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Abstract 

This paper formalizes the local density inequality approach to getting upper bounds for sphere 
packing densities in K". This approach was first suggested by L. Fejes-Toth in 1954 as a method to 
prove the Kepler conjecture that the densest packing of unit spheres in has density which 
is attained by the "cannonball packing." Local density inequalities give upper bounds for the sphere 
packing density formulated as an optimization problem of a nonlinear function over a compact set in 
a finite dimensional Euclidean space. The approaches of L. Fejes-Toth, of W.-Y. Hsiang, and of T. C. 
Hales, to the Kepler conjecture are each based on (different) local density inequalities. Recently T. C. 
Hales, together with S. P. Ferguson, has presented extensive details carrying out a modified version of 
the Hales approach to prove the Kepler conjecture. We describe the particular local density inequality 
underlying the Hales and Ferguson approach to prove Kepler's conjecture and sketch some features of 
their proof. 
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1. Introduction 

The Kepler conjecture was stated by Kepler in 1611. It asserts that the face-centered cubic 
lattice gives the tightest possible packing of unit spheres in . 

Kepler Conjecture. Any packing of unit spheres in has upper packing density 

pip) < ~ 0.740480 . (1.1) 
^ Vl8 



The definition of upper packing density is given in §2. The problem of proving the Kepler 



conjecture appears as part of Hilbert's 18th problem, see [Hi|. 

T. C. Hales has described an approach for proving Kepler's conjecture, and has announced 
a proof, completed with the aid of S. Ferguson, which is currently presented in a set of six 
preprints. The proof is computer-intensive, and involves checking over 5000 subproblems. 
The Hales approach is similar to earlier approaches in that it aims to prove a local density 
inequality that gives a (sharp) upper bound on the density. It involves several new ideas which 
are indicated in §4 and §5. 

Local density inequalities obtain upper bounds for the sphere packing constant via an 
auxiliary nonlinear optimization problem over a compact set of "local configurations". They 
measure a "local density" in the neighborhood of each sphere center separately. The general 
approach to the Kepler conjecture is first to find a local optimization problem that actually 
attains the optimal bound (assuming that one exists), and then to prove it. This approach 



was first suggested in the early 1950's by L. Fejes-T6th|FT , pp. 174-181], who presented some 
evidence that an optimal local density inequality might exist in three dimensions. 
The objects of this paper are: 

(i) To formulate local density inequalities for sphere packings in M", in sufficient generality 
to include the known candidates for optimal local inequalities. 

(ii) To review the history of local density inequalities for three dimensional sphere packing 
and the Kepler conjecture. 

(iii) To give a precise statement of the local density inequality considered in the Hales- 
Ferguson approach. 

(iv) To outline some of features of the Hales- Ferguson proof. 

In §2 we present a general framework for local density inequalities, which is valid in M", 
This framework is sufficient to cover the approaches of L. Fejes-Toth, 
and T. Hales and S. Ferguson to Kepler's conjecture. A different framework 
for local density inequalities appears in Oesterle [Oes]. In §3 we review the history of work 



given as Theorem 2.1 
W.-Y. Hsiang 



on local optimization inequalities for Kepler's conjecture. In §4 we describe the precise local 



optimization problem formulated by Ferguson and Hales in [ FH ] , which putatively attains 



In §5 we make remarks on some details of the proof strategy taken in the papers of Hales [3P-I|, 
| SP-II |, | SP-III |, | SP-IV |, Hales and Ferguson |FH], and Ferguson [ SP-V ]. In §6 we make some 
concluding remarks. 

The current status of the Hales-Ferguson proof is that it appears to be sound. The proof 
has reputedly been examined in fairly careful detail by a team of reviewers, but it is so long and 
complicated that it seems difficult for any one person to check it. This paper is intended as an 
aid in understanding the overall structure of the Hales-Ferguson proof approach. For another 
account of the Hales and Ferguson work, see Oesterle [ Oes |. For Hales' own perspective, see 



Hales P00 |. 

Two appendices are included which contain some information relevant to the Hales-Ferguson 
proof. Appendix A describes some of the Hales-Ferguson scoring functions. Appendix B lists 
references in the Hales and Ferguson preprints for proofs of lemmas and theorems stated with- 
out proof in §4 and §5. 

This paper is a slightly revised version of the manuscript [La99|. 
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||x|| < 1} is the unit n-sphere. It has volume 
We let C„(x, T) := x+ [0, T]" denote an n-cube 



of sidelength T, with sides parallel to the coordinate axes, and lowest corner at x G 



2. Local Density Inequalities 



In this section we present a general formulation of local density inequalities. 



We recall the standard definition of sphere packing densities, following Rogers |Ro|. Let O 
denote a set of unit sphere centers, so that ||v — v'|| > 2 for distinct v, v' G fi. 



Definition 2.1 (i) For a bounded region S in M", and a sphere packing + B„ specified by 
the sphere centers fJ, the density p{S) = p{Q, S) of the packing in the region S is 



p{S) := 



voi{Sn{n + Bn)) 

voliS) 



(2.1) 
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(ii) For T > the upper density p{Q,,T) is the maximum density of the packing O over all 
cubes of size T, i.e. 

p(n,r) := supp([0,r]" + x) . (2.2) 

Then the upper packing density of is 

p{9) := lim sup p{n, T) . (2.3) 

T^oo 

(iii) The sphere packing density (5(B„) of the ball B„ of unit radius is 

5(B„) :=supp(J^). (2.4) 



Definition 2.2 A sphere packing ^2 is saturated if no new sphere centers can be added to it. 

To obtain sphere packing bounds it obviously suffices to study saturated sphere packings, 
and in what follows we assume that all packings are saturated unless otherwise stated. 

Definition 2.3 An admissible partition rule is a rule assigning to each saturated packing 
in a collection of closed sets V{^) := {Ra = Ra{^)} with the following properties. 

(i) Partition. Each set Ra is a finite union of bounded convex polyhedra. The sets Ra 
cover W"' and have pairwise disjoint interiors. 

(ii) Locality. There is a positive constant C (independent of il) such that each region Ra 

has 

diameter{Ra) < C . (2.5) 
Each Ra is completely determined by the set of sphere centers w G with 

distance{w , Ra) < C . (2-6) 

There are at most C regions intersecting any cube of side 1. 

(iii) Translation-Invariance. The partition assigned to the translated packing J7' = J7 + x 
consists of the sets {Ra{^) + x}. 



Definition 2.4 An admissible weight function (or admissible score function) a for an admis- 
sible partition rule in M" assigns to each region Ra G 'P(^) and each v G Q a real weight 
a{Ra,"v) which satisfies \a{Ra,^r)\ < C* for an absolute constant C*, and which has the 
following properties. 

(i) Weighted Density Average. There are positive constants A and B (independent of il) 
such that for each set Ra, 

<^{Ra, v) = {A p{Ra) - B)vol{Ra) , (2.7) 

where 

p{Ra)vol{Ra) = vol{Ra H (O + B„)) (2.8) 
measures the volume covered in Ra by the sphere packing fl with unit spheres. 
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(ii) Locality. There is an absolute constant C (independent of 0,) such that each value 
cr(i?Q,,v) is completely determined by the set of sphere centers w G with ||w — v|| < C. 
Furthermore 

a{Ra,v) = if dist{v,Ra) > C . (2.9) 

(iii) Translation- Invariance. The weight function a' assigned to the translated packing 
r2' = + X satisfies 

o-'(i?Q + x,v + x) = cj(ii«,v) . (2.10) 
Note that this definition specifically allows negative weights. 

The "local density" is measured by the sum of the weights associated to a given vertex v 
in a saturated packing. 

Definition 2.5 (i) The vertex D-star (or decomposition star) I'(v) at a vertex v G 0, consists 
of all sets Ra G 'P(f^) such that a{Ra,^) 7^ 0. 

(ii). The total score assigned to a vertex D-star T>{v) at v G is 

Score{V{v)) := ^ a{Ra,v) . (2.11) 

The total score at v depends only on regions entirely contained within distance C of v. 
Any admissible partition and weight function (V, a) together yield a local inequality for the 
density of sphere packings, as follows. 

Theorem 2.1 Given an admissible partition in M" and weight function (V,(t), set 

e = e'p,a{A,B) := sup (sup Score{V{^r)U . (2.12) 
Q saturated Vven / 

and suppose that < KnA, where k„ is the volume of the unit n-sphere. Then the maximum 
sphere packing density satisfies 

Remark. (1) We let f{A,B,9) := ^'^^^g denote the packing density bound as a function of 
the score constants A and B. The sphere packing density bound actually depends only on the 
score constant ratio ^, rather than on B and A separately, since is a homongeneous linear 
function of A and B. This ratio detemines the relative weighting of covered and uncovered 
volume used in the inequality. 

(2) A natural approach to sphere packing bounds, used in many previous upper bounds, 
is to partition space into pieces i?(v) corresponding to each sphere center v, with each piece 
containing the unit sphere around v, and aims to establish an upper bound 

Then one obtains p{i^) < 6. An optimal sphere packing bound of this sort must necessarily 
be volume-independent, in the sense that if equality is to be attained at all local cells -R(v) 
simultaneously, then they must all have the same volume. In contrast the inequality of The- 



orem 2.1 does take into account the volumes of the individual pieces in the vertex D-star, 
and this allows more flexibility in the local density inequalities that can be constructed, which 
might attain an optimal bound. 
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Proof of Theorem 2.1. We may assume that is saturated. Given T > 0, and any e > 
we choose a point x G which attains the density bound p{0,,T) on the cube Cn(x, T) to 
within e. We evaluate the scores of ah vertex D-stars of vertices v G On C„(x, T) in two ways. 
First, by definition of 9, 

J2 Score{V{v)) <9#\nnCn{x,T)\ . (2.15) 
vennCn{x,T) 

However we also have 



^ 5core(P(v)) = ^ (^<T(i?„,v)) 

vGf^nc„(x,T) veonc„(x,T) V « / 



5] a(i?,,v) +o(r 

K.cc„(x,T) VvGf^nC„(x,T) / 

(A - B) voi{R^) + o(r 

R<,CC„{x;T) 

K„A#|J7nc„(x,r)| TOZ(c„(x,T)) + o(r 



= K„^#|finc„(x,r)| -Br" + o(T"-^) . (2.16) 

Here we use that fact that {Ra} partitions M", so covers the cube, and the 0{T^~^) error 
terms above occur because the counting is not perfect within a constant distance C of the 
boundary of the cube. Combining these evaluations yields 

{KnA - 0)#|f]nc„(x,T)| < sr" + o(T"-i.) 

If ^ < Kn^j then we can rewrite this as 



By assumption 

-p{n,T)-e < 



vol{C{x,T) n (1^ + B)) 

#|J7nc(x,r)| fi 



Together with ( 2.17| ), this yields 



with an O-symbol constant independent of e. Letting e — > and then T ^ oo gives the 
inequality for p{^l). Since this holds for all saturated packings the result follows. ■ 

Determining the quantity 6p^a{A, B) for fixed A, B can be viewed as a nonlinear optimiza- 
tion problem over a compact set. The translation-invariance property of ("P, a) allows the 
supremum ( p. 121) to be taken over the smaller set with v = and admissible VL containing 



0. The locality property shows that that the vertex D-star at is completely determined 
by w € with ||w|| < C . The set of such configurations of nearby sphere centers forms a 
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compact set in the Euclidean topology. Actually the partition and weight functions may be 
discontinuous functions of the locations of sphere centers, so the optimization problem above is 
not genuinely over a compact set. One must compactify the space of allowable vertex D-stars 
by allowing some sets of sphere centers to be assigned more than one possible vertex D-star. 
In practical cases there is a finite upper bound on the number of possibilities. 

Definition 2.6 A local density inequality in is optimal if 

f{A,B,6r,^) = 6{Bn) . (2.19) 

Optimal local density inequalities exist in one and two dimensions. In discussing the three- 
dimensional case, we shall presume that ^(63) = so that an optimal density inequality in 



will refer to one achieving this value. The evidence indicates that there are many different 
possible optimal local density inequalities in three dimensions, including that of the Hales and 
Ferguson proof. 

There are currently four candidates for local density inequalities that may be optimal in 
three dimensions. The first is that of L. Fejes-Toth, described in §3, which uses averages over 
Voronoi domains, in which the score constant ratio 4 = -S=. The second is that of Hsiang 
|Hs|, which is a modification of the Fejes-Toth averaging, and uses the same score constant 



ratio. The third is due to Hales | SP-1 |, and is based on the Delaunay triangulation, using a 



modified scoring rule described in §3. The fourth is that given in Ferguson and Hales |FH|, 
and uses a combination of Voronoi-type domains and Delaunay simplices, with a complicated 
scoring rule, described in §4. In the latter two cases the score constant ratio is 

R —Sir + 12 arccos(4=) 

- = 5oct = 7= — - 0.720903. 

A 

In all four cases the compact set of local configurations to be searched has very high dimension. 
Each sphere center has three degrees of freedom, and the number of sphere centers involved 
in these methods to determine a vertex D-star seems to be around 50, so the search space 
consists of components of dimension up to roughly 150. 

It is unknown whether optimal local density inequalities exist for the sphere packing prob- 
lem in in any dimension n > 4. In dimensions 4, 8 and 24 it seems plausible that the 
minimal volume Voronoi cell in any sphere packing actually occurs in the densest lattice pack- 
ing. If so, the Voronoi cell decomposition would yield an optimal local inequality in these 
dimensions, and the densest packing would be a lattice packing in these dimensions. Another 
question asks: in which dimensions is the maximal sphere packing density attained by a sphere 
packing whose centers form a finite number of cosets of an n-dimensional lattice? Perhaps 
in such dimensions an optimal local density inequality exists. The state of the art in sphere 



packings in dimensions four and above is given in Conway and Sloane |CS] 



3. History 

We survey results on local density inequalities in three dimensions. The work on local 
density bounds was originally based on two partitions of associated to a set J7 of sphere 
centers: the Voronoi tesselation and the Delaunay triangulation. Since they will play an 
important role, we recall their definitions. 
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Definition 3.1 The Voronoi domain (or Voronoi cell) of v G J7 is 

Vyori^) = K,or-(v, := {x G : ||x - v|| < ||x - w|| for all w € Q} . (3.1) 

The Voronoi tesselation for 17 is the set of Voronoi domains {K)or(v) : v G J^}. 

The Voronoi tesselation is a partition of space, up to boundaries of measure zero. If is a 
saturated sphere packing, then all Voronoi domains are compact sets with diameter bounded 
by 4^/2. 

Definition 3.2 The Delaunay triangulation associated to a set Q is dual to the Voronoi tesse- 
lation. It contains an edge between every pair of vertices that have Voronoi domains that share 
a common face. Suppose now that the points of 0, are in general position, which means that 
each corner of a Voronoi domain has exactly four incident Voronoi domains. In this case these 
Voronoi domains have between them four faces that touch this corner, and these faces in turn 
determine (four edges of) a Delaunay simplex. The resulting Delaunay simplices partition 
and make up the Delaunay triangulation. In the case of non-general position Q the Delaunay 
triangulation is not unique. The possible Delaunay triangulations are determined locally as 
limiting cases of general position points. (There are only finitely many triangulations possible 
in any bounded region of space.) 

All the simplices in a Delaunay triangulation have vertices Vj G and contain no other 
point V G 0. We define, more generally: 

Definition 3.3 A D-simplex (or weak Delaunay- simplex) for 17 is any tetrahedron T with 
vertices vi,V2,V3,V4 G 17 such that no other v G 17 is in the closure of T. We denote it 

i^(vi, V2, V3, V4). 

In the literature a Delaunay simplex associated to a point set 17 is any simplex with vertices 
in 17 whose circumscribing sphere contains no other vertex of 17 in its interior. All simplices in 
a Delaunay triangulation of 17 are Delaunay simplices, so are necessarily Z7-simplices, but the 
converse need not hold. 

The admissible partitions that have been seriously studied all consist of a domain ^(v) 
associated to each vertex v G y, which we call a V-cell, together with a collection of certain 
D- simplices ^(vi, V2, V3, V4) which we call the D-system of the partition. We use the term 
D-set to refer to a Z7-simplex included in the L)-system. We note that a F-cell may consist of 
several polyhedral pieces, and may even be disconnected. 

The original approach of L. Fejes-Toth to getting local upper bounds for sphere packing 
in used the Voronoi tesselation associated to 17. If 17 is a saturated packing, then each 
Voronoi domain K)or(v) is a bounded polyhedron consisting of points within distance at most 
4\/2 of V. Examples are known of Voronoi domains in a saturated packing that have 44 faces; 
an upper bound for the number of faces of a Voronoi domain of a saturated packing is 49. The 
Voronoi partition takes the l^-sets V{v) to be the Voronoi domains of v, with no -D-sets, and 
the vertex Z7-star Pt,or(v) is just 14or(v). A Voronoi scoring rule is: 

Score(P,o,(v)) := {p{V{v)) - B) vol{V{w)), . (3.2) 

with the score constants A = 1 and B is to be chosen optimally. Such scoring rules are 
admissible. However it has long been known that no Voronoi scoring rule gives an optimal 
inequality ( 2.19| ). The dodecahedral conjecture states that the maximum packing density of 



a Voronoi domain is attained for a local configuration of 12 spheres touching at the center of 
faces of a circumscribed regular dodecahedron. 
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Dodecahedral Conjecture. For a Voronoi domain K)or(v) of a unit sphere packing, 



p(Kor(v)) < 



15(1 - COS f tan f) 



0.754697 



(3.3) 



and equality is attained for the dodecahedral configuration. 

A proof of the dodecahedral conjecture has been announced by Hales and McLaughlin 
I Pod I, based on similar ideas to the Hales' approach to the Kepler conjecture. 

In 1953 L. Fejes-Toth ]FT] ] proposed that an optimal inequality might exist based on a 
weighted averaging over Voronoi domains near a given sphere center, and in 1964 he made 
a specific proposal for such an optimal inequality. In the notation of this paper he used the 
Voronoi partition and an admissible scoring function of the form 



cr(14or(w), v) := u;(w, v){(ylp(K,or(w)) - B)vol (T4or(w))}, 
in which A = 1 and B = and the weights uj{\v, v) are given by 



(3.4) 



w w, V 



^ if 2 < ||w- v|| < 2 + t , 



if llw- vll > 2 + t 



(3.5) 



and 



ti;(v,v) := 1 



;(w, v) 



(3.6) 



Here t > is a fixed constant. The resulting inequality in Theorem is optimal if the 
associated constant Op^^^^A, B) = 0. The Fejes-Toth scoring function corresponds to a weighted 
averaging over the spheres touching a central sphere at v, where the weight assigned the central 
sphere depends on how many spheres touch it. L. Fejes-Toth considered choosing t as large as 
possible consistent with requiring that a;(v, v) > 0, which is equivalent to requiring that it is 
impossible to pack 13 spheres around a given sphere, with all 13 sphere centers within distance 
2 -|- t of the center of the given sphere. In 1964 he suggested pTl , p. 299] that one could 
take the value t = 0.0534, and we consider this to be Fejes-Toth's candidate for an optimal 
inequality. It has not been demonstrated that u;(v, v) > holds for this value of t, but note 



that the argument of Theorem 2.1 is valid for any value of t, even if some w(v, v) are negative. 



The only issue is whether the resulting sphere packing bound is optimal. Fejes-Toth [FT1| 
explicitly noted that establishing an optimal inequality, if it is true, reduces the problem in 
principle to one in a finite number of variables, possibly amenable to solution by computer. 

In 1993 Wu-Yi Hsiang |Hs| studied a variant of the Fejes-Toth approach. He used the 
Voronoi partition and an admissible scoring function |^ of the form ( |3.4| ) , with the same A = 1 
and B = and with the weights a;(w, v) > given by 



w w, V 



1+Af(v) 





if 
if 



w 



w 



< 



> 



where 



218 
100 ' 

218 
100 ' 

218 



iV(v) := # <j w G : < ||w - v|| < — 



(3.7) 



(3.8) 



^We have converted the locaUy averaged density in Hsiang [|B[^, Section 3] to the form given in §2 by clearing 
denominators, and we have cancelled out Hsiang's factor of 13. Note also that each Voronoi domain contains 
exactly one sphere, so that p{V {'v))vol{V {w)) = 
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counts the number of "near neighbors" of v. Hsiang announced that his local inequality is 
optimal (with 6-p^i^{A, B) = 0), and that he had proved it, which would then constitute a 
proof of Kepler's conjecture. However his proof of optimality is regarded as incomplete by 
the mathematical community, see G. Fejes-Toth's review of Hsiang's paper in Mathematical 



Reviews, the critique in |Ha94], and Hsiang's rejoinder [Hs2|. 

In 1992 Hales ( pil| , |H2[ |) studied the Delaunay triangulation, which partitions M.^ into D- 
sets. There are a finite number of (local) choices for Delaunay triangulations of a neighborhood 
of a fixed v G fi. Hales used the following function in defining his associated weight function. 

Definition 3.4 The compression of a finite region R in with respect to a sphere packing 
O is 

T{R) := {p{R) - 5oct)vol{R) (3.9) 

in which 

— Stt + 12arccos(4=) 

5oct ■■= 7= — ~ 0.720903 (3.10) 

v8 

is the packing density of the regular octahedron of sidelength 2 with unit spheres centered at 
its vertices. 

Hales initially considered the admissible weight function 

fT(D(vi, V2, V3, V4), Vj) := r(L»(vi, V2, V3, V4)). (3.11) 

The vertex D-star of v consists of all the simplices in the Delaunay triangulation that have v 
as a vertex; we call this set of simplices the Delaunay D-star 2?z)e«(v) at v. Hales used score 
constants A = A and B = ASoct^ with A = 4 used since each simplex is counted four times. 
However he discovered that the pentagonal prism attained a score value exceeding what is 
needed to prove Kepler's conjecture. The pentagonal prism is conjectured to be extremal for 
this score function. 

The fact that the (conjectured) extremal configurations for the Voronoi tesselation and 
Delaunay triangulation do not coincide suggested to Hales that a hybrid scoring rule be con- 
sidered that combines the best features of the Voronoi and Delaunay scoring function. In 1997 
Hales again considered a Delaunay triangulation, but modified the scoring rule to depend on 
the shape of the D-simplex D(vi, V2, V3, V4). For some simplices he used the weight function 
above, while for others he cut the simplex into four pieces, one for each vertex, call the pieces 
V{D,^ri), and assigned the weights^ 

a(L>(vi, V2, V3, V4), Vi) = 4T{ViD, Vi)), 

for 1 < i < 4. He also partitioned a vertex D-star into pieces called "clusters" whose score 
functions could be evaluated separately and added up to get the total score. Each "cluster" 
is a finite union of Delaunay simplices filling up that part of the vertex D-star at v lying in a 
pointed cone with vertex v. This vertex cone subdivision facilitates computer-aided proofs by 
decomposing the problem into smaller subproblems. Hales ( fSP-l ], [ SP-H ]) presented evidence 



that this modified scoring function satisfies an optimal local inequality. He showed that the 



More precisely, he used the "analytic continuation" of this scoring function that is described in Appendix 

A. 
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two known local extremal configurations |^ gave local maxima of the score of the Delaunay 
D-star in the configuration space with 

Score{VDeiM) = 8pt , (3.12) 

where 

llvr / 1 \ , , 

pt:= — - 12arccos (^-^J ~ 0.0553736 , (3.13) 

which is the optimal value. He also showed that this was a global upper bound over the subset 
of configurations described by a vertex map^ ^(v) that is triangulated. Hales [ pP-I , Conjecture 



2.2] conjectured that the modified score function achieved the optimal inequality 

Score{VDei{^)) < 8pt , (3.14) 



for all Delaunay D-stars I'z)ez(v). However he and his student S. P. Ferguson [|FH[| discovered 
that a pentagonal prism configuration comes very close to violating the inequality ( 3.14] ). 



Furthermore there turned out to be many similar difficult configurations which might possibly 
violate the inequality. These and other difficulties indicated that it was not numerically feasible 
to prove ( 3.14[ ) by a computer proof, assuming that ( 3.14| ) is actually true. 



Hales and Ferguson together then further modified both the partition rule V and the scoring 
rule a, to obtain a rule with the following properties. 

(i) It makes the score inequality stronger on the known bad cases related to the pentagonal 
prism configuration. 

(ii) It uses a more complicated notion of "cluster", which includes Voronoi pieces as well as 
D-sets, and which retains the "decoupling" property that it is completely determined by 
vertices of in the cone above it. 

(iii) It chooses a scoring function which when combined with "truncation' on clusters is still 
strong enough to rule out most configurations. The "truncation" operation greatly re- 
duces the number of configurations to be checked, at the cost of weakening the inequality 
to be proved. 

In §4 we give a precise description of the Hales- Ferguson rules {V,a). 
4. Hales-Ferguson Partition and Score Function 



Ferguson and Hales [[FHl use the following partition and scoring rule. The partition uses 
two types of D-simplices, with a complicated rule for picking which ones to include as L'-sets 
in the partition. Modified Voronoi domains V{v) are used as y-sets. These differ from the 
usual Voronoi domain (with the D-sets removed) by mutually exchanging some regions called 
"tips". The scoring rule is also complicated: the weight function used on a D-simplex no longer 
depends on just its shape, but depends on the structure of nearby D-sets. 

We begin by defining the two types of D-simplices. 

Definition 4.1 A QR-tetrahedron (or quasi-regular tetrahedron) is any tetrahedron with all 
vertices in Q and all edges of length < |gi . 

^These c orres pond to Voronoi cells being the rhombic dodecahedron or trapezo-rhombic dodecahedron in 



Fejes-Toth ]FTl| , p. 295]. 

*See §5 for a definition of vertex map t/(v) 
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Definition 4.2 A QL-tetrahedron (or quarter) is any tetrahedron with all vertices in 0, and 
five edges of length < ^ and one edge with length |gg < I < 2\/2. The long edge is called 
the spine (or diagonal) of the QL-tetrahedron. 

For some purposes^ the case of a spine of length exactly should be considered as either 
a QL-tetrahedron and QR-tetrahedron. Here we treat it exclusively as a QR-tetrahedron. 

Neither kind of tetrahedron is guaranteed to be included in the Delaunay triangulation of 
O, but we do have: 

Lemma 4.1 All QR-tetrahedra and QL-tetrahedra are D-simplices. 

The Hales-Ferguson partition rule starts by selecting which D-simplices to include in the 
D-system. These consist of: 

(i) All QR-tetrahedra. 

(ii) Some QL-tetrahedra. The QL-tetrahedra included in the partition satisfy the common 
spine condition which states that for a given spine, either all QL-tetrahedra having that 
spine are included, or none are. 

This collection of tetrahedra must (by definition of admissible partition) form a nonoverlapping 
set, where we say that two sets 5i and 5*2 overlap if 5*1 n 5*2 has positive Lebesgue measure in 
M^. To justify (i) we have: 

Lemma 4.2 No two QR-tetrahedra overlap. 

QL-tetrahedra may overlap Qi?-tetrahedra or other QL-tetrahedra, hence one needs a rule 
for deciding which QL-tetrahedra to include. To begin with, QL-tetrahedra can overlap QR- 
tetrahedra in essentially one way. 

Lemma 4.3 If a QL-tetrahedron and QR-tetrahedron overlap, then the QR-tetrahedron has 
a common face with an adjacent QR-tetrahedron, and the two unshared vertices of these QR- 
tetrahedra are the endpoints of the spine of the QL-tetrahedron. The union of these two QR- 
tetrahedra can be partitioned into three QL-tetrahedra having the given spine, which includes 
the given QL-tetrahedron. Aside from these QL-tetrahedra, no other QL-tetrahedron overlaps 
either of these two QR-tetrahedra. 

This lemma shows that the QL-tetrahedra having a given spine have the property that 
either all of them or none of them overlap the set of Qi?-tetrahedra. We next consider how QL- 
tetrahedra can overlap other QL-tetrahedra. The following configuration plays an important 
role. 

Definition 4.3 A Q-octahedron is an octahedron whose 6 vertices Vj G $7 and whose 12 edges 
each have lengths 2 < I < jg^. 

A Q-octahedron has three interior diagonals. If a diagonal has length 2 < / < then it 

partitions the Q-octahedron into four Qi?-tetrahedra. If a diagonal has length |gg < / < 2^/2 

then it partitions the Q-octahedron into four QL-tetrahedra of which is the common spine. 

If a diagonal has length / > 2\/2 it yields no partition. A Q-octahedron thus gives between 

zero and three different partitions into four Qi2-tetrahedra or QL-octahedra. We call it a live 

Q-octahedron if it has at least one such partition. Lemma |4.3| implies that if it has a partition 

into Qi?-octahedra then it has no other partition into Qi?-octahedra or QL-octahedra. 

■''In proving inequalities one wants to work on a compact set. In compactifying the space of configurations, 
this requires allowing the lower inequality in the definition of QL-tetrahedron to be an equality. 
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Lemma 4.4 A QL -tetrahedron having spine a diagonal of a Q-octahedron does not overlap 
any QL -tetrahedron whose spine is not a diagonal of the same Q-octahedron. 



The rules for choosing which QL-tetrahedra to include either take all QL-tetrahedra having 
the same spine or take none of them. Thus the selection rule really specifies which spines to 
include. 

Definition 4.4 Consider an edge [vi, V2] with vi, V2 G Vl and < ||vi — V2II < 2\/2. A 
vertex w G is called an anchor of the edge [vi, V2] if 

„ 251 , 
w — V,' < for i = 1,2 . 

II III - -^QQ 

Ferguson and Hales use the number of Q-L-tetrahedra having a spine [vi , V2] and the number 
of anchors of that spine in deciding which QL-tetrahedra to include in the D-system. Call a 
QL-tetrahedron isolated if it is the only Q-L-tetrahedron on its spine [vi,V2]. The inclusion 
rule for an isolated Q-L-tetrahedron is: 

(QLO) An isolated Q-^^-tetrahedron is included in the D-system if and only if it overlaps^ no 
other QL-tetrahedron or QiJ-tetrahedron. 

Next consider spines [vi,V2] which have two or more associated QL-tetrahedra. Such spines 
have at least three anchors, and the inclusion rules are: 

(QLl) Each non-isolated QL-tetrahedron on a spine with 5 or more anchors is included in the 
-D-system. 

(QL2) Each non-isolated QL-tetrahedron on a spine with 4 anchors is included in the D-system, 
if the spine is not a diagonal of some Q-octahedron. In the case of a live Q-octahedron, we 
include all QL-tetrahedra having one particular diagonal, and exclude all QL-tetrahedra 
on other diagonals. For definiteness, we choose the spine to be the shortest diagonal. In 
case of a tie for shortest diagonal, a suitable tie-breaking rule is used. 

(QL3) Each non-isolated QL-tetrahedron on a spine with 3 anchors is included in the D-system 
if each QL-tetrahedron on the spine does not overlap any other QL-tetrahedron or QR- 
tetrahedron, or overlaps only isolated QL-tetrahedra. It is excluded from the D-system 
if some tetrahedron on the spine overlaps either a Qi?-tetrahedron or a non-isolated 
QL-tetrahedron having four or more anchors. Finally, if some tetrahedron on the spine 
overlaps a nonisolated QL-tetrahedron having exactly three anchors, then the spine of the 
overlapped set is unique, and exactly one of these two sets of non-isolated QL-tetrahedra 
with 3 anchors is to be included in the D-system, according to a tie-breaking ruleQ. 

The set of QL-tetrahedra selected above are pairwise disjoint and are disjoint from all QR- 
tetrahedra. This is justified by the following lemma. 

Lemma 4.5 (i) If two QL-tetrahedra overlap, then at most one of them has 5 or more anchors. 

(a) If two overlapping QL-tetrahedra each have 4 anchors, then their spines are (distinct) 
diagonals of some Q-octahedron. 



It cannot overlap a Qf?-tetrahedron by Lemma 4.3 

7- 



The tie-breaking rule could be to include the spine with lowest endpoint using a lexicographic ordering 
of point s in R^. It appears to me that Hales would permit an arbitrary choice of which one to include, see 



Lemma 4.5 (iii 
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(in) If a nonisolated QL -tetrahedron with 3 anchors overlaps another QL -tetrahedron hav- 
ing three anchors, then each of their spines contains exactly two nonisolated QL-tetrahedra, 
and these four QL-tetrahedra overlap no other QL -tetrahedron or QR-tetrahedron. 

We call the set of QiJ-tetrahedra and Q-L-tetrahedra selected as above the Hales-Ferguson 
D-system. (Hales and Ferguson call this a Q-system.) 

We now define the V-cells of the Hales-Ferguson partition. To begin with, we take the 
Voronoi domain K)or(v) at vertex v and remove from it all D-simplices in the D-system to 
obtain a reduced Voronoi region Vred{^)- Next we move certain regions of Vred{^) called "tips" 
to neighboring reduced Voronoi regions to obtain modified regions Knod(v) and finally we 
define the T^-cell V{^) at v to be the closure of Vmod{^)- 

Definition 4.5 Let T be any tetrahedron such that the center x = x(T) of its circumscribing 
sphere lies outside T. A vertex v of T is negative if the plane H determined by the face F of 
T opposite V separates v from x. The "iip" A(r, v) of T associated to a negative vertex v 
is that part of the Voronoi region of v with respect to the points {vi, V2, V3, V4} that lies in 
the closed half plane determined by H that contains x. The "tip" region A(r, v) does not 
overlap T, and is a tetrahedron having x as a vertex, and has three other vertices lying on H. 



Lemma 4.6 (i) A QR-tetrahedron or QL -tetrahedron T has at most one negative vertex. 

(a) If a negative vertex is present, then the three vertices of the associated "tip" that lie on 
H actually lie in the face of T opposite to the negative vertex. 

(Hi) The "tip" of any tetrahedron in the Hales-Ferguson D-system either does not overlap 
any D -simplex in the Hales-Ferguson D-system, or else is entirely contained in the union of 
the D-simplices in the D-system. 

We say that a "tip" that does not overlap any Z?-set is uncovered. The lemma shows 
that uncovered "tips" lie in the union of the Voronoi regions {K'ed(w) : w G il}, so that 
rearrangement of uncovered "tips" is legal. There is an a priori possibility that two "tips" may 
overlap]^ each other. 

Uncovered Tip Rearrangement Rule. Each y G that belongs to an uncovered "tip" 
is reassigned to the nearest vertex w G Q such that y is not in an uncovered "tip" of any 
pair (r, w) where T is in the Hales-Ferguson D-system and w is a negative vertex of v. (A 
tiebreaking rule is used if two nearest vertices w are equidistant.) 

This rule cuts an uncovered "tip" into a finite number of polyhedral pieces and reassigns 
the pieces to different reduced Voronoi regions. This prescribes how Vmod(y) is constructed, 
and thus defines the Hales-Ferguson V-cells V{v). 

A two-dimensional analogue of a "tip" is pictured^ in Figure 1. In this figure the triangle 
T = [vi,V2,V3] plays the role of a D-simplex, with V2 as a negative vertex and the "tip" is 
the shaded region. The points C012, cqis, C123 are centroids of the triangles determined by the 
corresponding Vj's. The shaded triangle [0012)^013,0123] is in the Voronoi cell V^or(vo) while 
the remainder of the "tip" is in the Voronoi cell Vuor(v2)- The uncovered tip rearrangement 
rule partitions the part in 14'or(v2) into three triangles which are reassigned to the y-cells 
V{vq),V(vi) and V(v3), e.g. [yi,y2,coi2] is reassigned to V{vi). The reassignment of the 

^ I don't know if this pos sibility can occur. 

^ See Figure 2.1 of Hales [SP-II for another example. 
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Figure 1: "Tip" of D-simplex [vi,V2,V3] 

"tip" ensures that the pointed cone over V2 generated by the D-simpIex [vi,V2,V3] does not 
contain any part of the V-cell at V2. In this example the y-cell at vq does not feel the effect 
of the vertex V2, due to the rearrangement. 

We now turn to the Hales-Ferguson scoring rules. These use the compression function T{S) 
given in ( |3.9D . The compression function is additive: If S" = S*! U 5*2 is a partition, then 



r{S) = T{Si)+T{S2) . (4.1) 

For a L)-simplex T, 

,oi(r)p(r) = X: "°'"r"''' . ("-2) 

i=l 



where a full solid angle is 47r. 

The Hales-Ferguson weight function for a V-cell is as follows. 

(SI) For a V-cell V{v), 

' 4r(y(v)) if V = w , 



aHF{V iv),w) 



(4.3) 



if V 7^ w . 
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We next consider the weight function for D-sets. Let (T, v) denote a Z)-simplex together 
with a vertex v of it. 

Definition 4.6 The Voronoi measure vor{T, v) is defined as follows. If the center of the 
circumscribing sphere of T lies inside T, then T is partitioned into four pieces 

y+r(^,Vi) :={xGT: ||x-v,|| < ||x-vj-|| for 1 < j < 4} 

and then 

t;or(r,v):=r(y+.(T,v)) . (4.4) 

There is an analytic formula for the right side of ([4. 4)) given in Appendix A, and this formula 
is used to define vor{T, v) in cases where the circumcenter falls outside T. 

In cases where the circumcenter is outside v, and v is a negative vertex, then 

vor{T,v) = r(Kor(r, v) U "tip") (4.5) 

while for the other three vertices parts of the "tip" are counted with a negative weight, in such 
a way that 

4 

J2vor{T,^^,) = AT{T) (4.6) 

1=1 

holds in all cases. The weight function for a D-set is given as follows: 
(S2) For a Qi^-tetrahedron T = £'(vi, V2, V3, V4) in the D-system, 

r(T) if the circumradius of T is at most jgg. 



(4.7) 



vor{T,v) if the circumradius of T exceeds jq^. 



The QL-tetrahedron scoring function is complicated. For a QL-tetrahedron T. let rj^{T) be 
the maximum of the circumradii of the two triangular faces of T adjacent to the spine of T, 
and define the function 

r r(T) if rj+{T) < V2 
^i{T,^r) := }. (4.8) 
[ vor{T,v) if r/+(T) > V2 

Then the QL-tetrahedron scoring function is defined by: 

(53) ( "Flat quarter" case) For a Q-^-tetrahedron T and a vertex v not on its spine, 

aHF{T,v) := ^iiT,v) . (4.9) 

(54) ("Upright quarter case") For a QL-tetrahedron T with vertex v on its spine, let v denote 
the opposite vertex on the spine. If T is an isolated QL-tetrahedron, set 

aHF(T,v) :=/z(r,v) . (4.10) 

If T is part of a Q-octahedron, set 

aHF{T,^r) := ^(/^(r, v) + ^(T, v)) . (4.11) 
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In all other cases, set 

aHF(T,v) :=^(/x(r,v)+/i(r,v)) + ^(i;oro(r,v)-z;oro(r,v)) , (4.12) 

in which foro(T, v) is a "truncated Voronoi measure" that only counts volume within 
radius ^(fgg) of vertex v, which is defined in Appendix A, and in [FH, pp. 9-11]. 



The scoring rule (S4) is the most complicated one. In it the definition (4.11) plays an important 



role in obtaining good bounds for the pentagonal prism case treated in Ferguson | SP-V ], while 



the definition ( 4.12| ) is important in analyzing general configurations using truncation in Hales 



|SP-IV|. 



Theorem 4.1 The Hales-Ferguson partition and scoring function {Vhf^o'hf) (^re admissible, 
with score constants A = A and B = Adoct- 

Proof. It is easy to verify that the definitions for scoring Q-R-tetrahedra and Q-^^-tetrahedra 
satisfy the weighted density average property 

4 

^aHF(T,Vi) = 4r(r) , (4.13) 
1=1 

which correspond to A = 4 and B = 4(5oct, using (|4.6|). Most of the remaining admissibility 



conditions are verified by Lemmas |4.lH4.6| except for locality. For locality, a conservative 
estimate indicates that the rules for removing and adding "tips" to determine the y-cell V{'v) 
are determined by sphere centers w G with ||w — v|| < 12\/2. Finally the score function on 
the D-simplices is determined by vertices within distance of v. ■ 

Theorem ^?l] associates to {Vhf, o'hf) a sphere-packing bound that the Hales program 
asserts is optimal. To establish the Kepler bound 

m < 4^ , (4.14) 

V io 



via ( 2.15 ), one must prove that 

e := ev„p,a„A^A^oct) = spt , (4.15) 

where 

pt:=^- 12arccos ( ^ ) ^ 0.0553736 . (4.16) 



3 VV3. 

The score function Score{'DHF{^)) is discontinuous as a function of the sphere centers in 
Q near v, because it is a sum of contributions of pieces which may appear and disappear as 
sphere centers move, and discontinuities occur when QL-tetrahedra convert to Q-R-tetrahedra. 
To deal with this, one compactifies the configuration space by allowing some sphere center 
configurations to have more than one legal decomposition into pieces (but at most finitely 
many). The optimization problem can then be split into a finite number of subproblems on 
each of which cthf is continuous. 

The complexity of the definition of {Vhf, o'hf) is designed to yield a computationally 
tractable nonlinear optimization problem. The introduction of QL-tetrahedra and the com- 
plicated score function on them is designed to help get good bounds for the pentagonal prism 
case and similar cases. The rule for moving "tips" is intended to facilitate decomposition of 



the nonlinear optimization problem into more tractable pieces via Theorem |5.5| below, and the 
use of "truncation." 
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5. Kepler Conjecture 



The main result to be established by the Hales program is the following. 

Theorem 5.1 (Main Theorem) For the Hales-Ferguson partition and scoring rule {VhFj (^hf) 
and any v £ in a saturated sphere-packing, the vertex D-star Dnpi'^) o-t v satisfies 

Score{VHF{-v)) < 8pt , (5.1) 

where pt := ^ - 12arccos (-^] ~ 0.0553736. 



The Kepler conjecture follows by Theorem 2.1. 



To prove the inequality ( |5.1D , by translation-invariance we can reduce to the case v = 
and search the set of all possible vertex stars, which by §4 are determined by those points 
w G with ||w|| < 12\/2. From now on we assume € O and v = 0. 

The space of possible sphere centers {w G ft : ||w|| < 12\/2} is compact. It can be 
decomposed into a large number of pieces, on each of which the score function is continuous. 
To obtain compact pieces, we must compactify the configuration space by assigning more than 
one possible local D-star T>{v) to certain arrangements of sphere centers. The compactification 
assigns at most finitely many to each arrangement with an absolute upper bound on the number 
of possibilities. 

The definition of the score Score{T>{v)) involves a sum over the l^-sets and D-sets. The 
usefulness of the compression measure T{S) is justified by the following lemma. 

Lemma 5.1 (i) Every Q R-tetrahedron T satisfies 

r{T) < pt , (5.2) 

with equality occurring only when T is a regular tetrahedron of edge length 2. 
(ii) A QL -tetrahedron T has 

r(T) < , (5.3) 
with equality occurring for those T having five edges of length 2 and a spine of length 2^/2. 

Result (ii) illustrates a somewhat counterintuitive behavior of the local density function: 
when holding five edges of a tetrahedron fixed of length 2, and allowing the sixth edge to vary 
over < / < 2^/2, the local density measure is largest for a spine of maximal length. 

The vertices w G with ||w|| < play a particularly important role, for they determine 
all Qi^-simplices of Vt containing as a vertex. 

Definition 5.1 The planar map (or graph) ^(v) associated to a vertex v G consists of the 
radial projection onto the unit sphere dB{v, 1) = {x G : ||x — v|| = 1} centered at v of 
all vertices w G $7 with ||w — v|| < plus all those edges [w, w'] between two such vertices 
which have length ||w — w'|| < 



100 

251 
100- 



Here we regard the planar map ^(v) as being given with its embedding as a set of arcs on 
the sphere. The following lemma asserts that no new vertices are introduced other than those 
coming from points of Q with ||w|| < 

Lemma 5.2 The radial projection of two edges [wi,W2], [w']^,W2] as above onto the unit sphere 
dB{v,l) give two arcs in G{v) which either are disjoint or which intersect at an endpoint of 
both arcs. 
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We study local configurations classified by the planar map ^(0). The planar map ^(0), 
which is determined by the vertices ||w|| < does not in general uniquely determine the 
vertex D-star Pj^f(O), but does determine all points x in it with ||x|| < 

Definition 5.2 The part of 2?//i7(v) that lies in the pointed cone with base point v determined 
by a face of the map ^(v) is called the cluster over that face. Note that the face need not be 
convex, or even simply connected- it could be topologically an annulus, for example. 

The following lemma shows that the vertex D-star DnpiO) can be cut up into clusters in 
a way compatible with the scoring function. 

Lemma 5.3 Each QR-tetrahedron or QL -tetrahedron in the D-star T)hf{^) is contained in 
a single cluster. Furthermore all such tetrahedra having a common spine are contained in a 
single cluster. 

In effect the partition of the vertex D-star into clusters partitions the V-cell into smaller 
pieces, while leaving the D-sets unaffected. The scoring function is additive over any partition 



of a V^-cell into smaller pieces, according to ( |4.1[ ) and (O). The score anriF) of the cluster 
determined by a face -F of ^ ( v) is the sum of the scores of the Q-R-tetrahedra and Q-^^-tetrahedra 
in the cluster, plus the Voronoi score 4T{R) of the remaining part R of the cluster. We then 
have 

Score{VHFM)= Yl '^hf{F). (5.4) 
Feg(v) 

We now consider clusters associated to the simplest faces F in the graph ^(v). Each trian- 
gular face corresonds to a Q-R-tetrahedron in Vhf{"^), and, conversely, each Q-R-tetrahedron 
in T)hf{^) produces a triangular face. A quad cluster is a cluster over a quadrateral face. A 
Q-octahedron with spine ending at results in a quadrilateral face, but there are many other 
kinds of quad clusters. In the case of faces F with > 5 edges, the cluster may consist of a 
y-cell plus some QL-tetrahedra, in many possible ways. All the possible decompositions into 
such pieces have to be considered as separate configurations. 

Lemma 5.4 (i) A cluster over a triangular face F consists of a single QR-tetrahedron, and 
conversely. The score of such a cluster is at most 1 pt, and equality holds if and only if it is a 
regular tetrahedron of edge length 2. 

(a) The sum of the score functions over any quad cluster is at most zero. Equality can 
occur only if the four sphere centers Vj corresponding to the vertices of the quad cluster each 
lie at distance 2 from v and also from each other, if they share an edge of the quad cluster. 

(Hi) The score of a cluster over any face with five or more sides is strictly negative. 



The extremal graphs where equality is known to occur in ( |5.1[ ) have eight triangular faces 
and six quadrilateral faces. The upper bound of 8pt for these cases is implied by this lemma. 
(It appeared first in Hales |SP-II, Theorem 4.1].) 



The following result rules out graphs ^(v) with faces of high degree. 
Theorem 5.2 AH decomposition stars T>HFiO) with planar maps 0(0) satisfy 

Score{VHF{0)) < 8pt (5.5) 

unless the planar map G{0) consists entirely of (not necessarily convex) faces of the following 
kinds: polygons having at most 8 sides, in which pentagons and hexagons may contain an 
isolated interior vertex or a single edge from an interior vertex to an outside vertex, and a 
pentagon may exclude from its interior a triangle with two interior vertices. 
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There remain a finite set of possible map structures tliat satisfy the conditions of Theorem 
5^^. Here we use the fact that there can be at most 50 vertices v with ||v|| < The hst 



is further pruned by various methods, and reduced to about 5000 cases. Since the (putative) 



extremal cases are already covered by Lemma 5.4, in the remaining cases one wishes to prove 



a strict inequality in and such bounds can be obtained in principle by computer. 

Most of the remaining cases are eliminated by linear programming bounds. The linear 
programs involve obtain upper bounds for the score function Score{T>HF{0)) for a planar map 
^ of a particular configuration type, using as objective function the score function, in the form: 

Maximize ScoreiVHpiO)) ■= <7(F) , (5-6) 

faces 
F 

where the variable (j{F) is the sum of weights associated to the cluster over the face F. The 
use of linear programming relaxations of the nonlinear program seems to be a necessity in 
bounding the score function. For example the compression function T(R) for different regions 
R is badly behaved: it is neither convex nor concave in general. The linear constraints include 
hyperplanes bounding the convex hull of the score function over the variable space. 

One can decouple the contributions of the separate faces F of ^(v) using the following 
result. 

Lemma 5.5 (Decoupling Lemma) Let v €z Q be a vertex of a saturated packing and let F be 
a face of the associated planar map ^(v), and let Cp denote the (closed) pointed cone over F 
with vertex v, and let Cp^red denote the closure of the cone over F obtained by removing from 
Cp all cones over D-sets with a corner at v. Then the portion of the V-cell V{v) that lies in 
Cp is completely determined by the vertices of il. that fall in the smallest closed convex cone 
Cp containing Cp. In particular, 

Vf := v{v) nCp = v{n n Cp, v) n Cp^red ■ (5.7) 

To obtain such a decoupling lemma requires the exchange of "tips" between Voronoi do- 
mains, as described in §4. 

The decoupling lemma permits the score function a{V{v),v) to be decomposed into poly- 
hedral pieces that depend on only a few of the nearby vertices. This decomposes the problem 
into a sum of smaller problems, to bound the scores of the pieces a{V{v) n Cp,v) in terms of 
these vertices. It will often be applied when the face F is convex, in which case Cp = Cp. 

A futher very important relaxation of the linear programs involves "truncation." The 
truncated V-cell is 

251 

V^™„c(v):=y(v)nB(v:— ). (5.8) 
We may consider truncation of that part of the V — cell over each face of Q separately. 

Lemma 5.6 Let F be a face ofQ{v) andCp the cone over that face. The region Vtrunc{'v)f^Cp 
is entirely determined by the vertices ofG{v) in Cp. IfVp denotes this set of vertices, together 
withv then this region is the closure of {VyorO^,^)C^Cp) — {D—sets}. The compression function 
satisfies the bound 

r{VtruncM n Cp) > r(y(v) n Cp). (5.9) 



The inequality ( p.9\ ) implies that replacing a Voronoi-type region by a truncated region 
can only increase the score, hence one can relax the linear program by using the score of 
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truncated regions. If one is lucky the linear programming bounds using truncated regions will 
still be strong enough to give the desired inequality. The use of truncation greatly reduces 
the number of configurations that must be examined. Truncation bounds were also used in 



proving Theorem 5.2 above 



We add the following remarks about the construction of the linear programming problems. 

(1) For each face F of a given graph type Q Hales and Ferguson construct a large number 
of linear programming constraints in terms of the edge lengths, dihedral angles and solid 
angles of the polyhedral pieces making up the cluster of Vhf{"^) over face F of the graph 
Q. The edge lengths, dihedral angles and solid angles are variables in the linear program. 
Some of the constraints embody geometric restrictions that a polyhedron of the given 
type must satisfy. Others of them are inequalities relating the weight function of the 
polyhedron, which is also a variable in the linear program, to the geometric quantities. 
The inequalities bound the score function on the cluster (either as a V-cell or as D-sets) 
in terms of these variables. There are also some global constraints in the linear program, 
for example that the solid angles of the faces around v add up to Att. 

(2) The weight function for D-sets does not permit subdivision of the simplex, but the weight 
function on the l^-cell is additive under subdivision, so one can cut such regions up into 
smaller pieces if necessary, to get improved linear programming bounds, by including 
more stringent constraints. 

(3) In the linear programming relaxation, a feasible solution to the constraints need not 
correspond to any geometrically constructible vertex D-star. All that is required is that 
every vertex D-star of the particular configuration type correspond to some feasible point 
of the linear program. 

In this fashion one obtains a long list of linear programs, one for each configuration type, 
and to rule out a map type Q one needs an upper bound for the linear program's objective 
function strictly below 8 pt. To rigorously obtain such an upper bound, it suffices to find a 
feasible solution to the dual linear program, and to obtain a good upper bound one wants the 
dual feasible solution close to a dual optimal solution. The value of the dual LP's objective 
function is then a certified upper bound to the primal LP. To obtain such a certification, it 
is useful to formulate the linear programs so that the dual linear program has only inequality 
constraints, with no equality constraints, so that the feasible region for it is full-dimensional. 
This way, one can guarantee that the dual feasible solution is strictly inside the dual feasible 
region which facilitates checking feasibility. This is necessary because the linear program 
put on the computer is only an approximation to the true linear program. For example, 
certain constraints of the true LP involve transcendental numbers like vr, and one considers an 
approximation. The effect of these errors is to perturb the objective function of the dual linear 
program. Thus a rigorous bound on the effect of these perturbations on the upper bound can 
be obtained in terms of the dual feasible solution. In this way one can (in principle) get a 
certified upper bound |^ on the score for a map type Q, using a computer. 

The linear programming bounds in the Ferguson-Hales approach above suffice to eliminate 



all map types Q not ruled out by Theorem 5.2 except for about 100 "bad" cases. These are 



then handled by ad hoc methods. [I am not sure of the details about how these remaining 

^"The Hales proof in the preprints used a linear programming package CPLEX that does not supply such 
certificates. Therefore the linear programming part of the Ferguson-Hales proof needs to be re-done to obtain 
guaranteed certificates. 
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"bad" cases are handled. Presumably they are split into smaller pieces, extra inequalities are 
generated somehow, and perhaps specific information on the location of vertices more than |gg 
is incorporated into the linear programs.] 



6. Concluding Remarks 

The Kepler conjecture appears to be an extraordinarily difficult nonlinear optimization 
problem. The "configuration space" to be optimized over has an extremely complicated struc- 
ture, of high dimensionality, and the function being optimized is highly nonlinear and non- 
convex, and lacks good monotonicity properties. The crux of the Hales approach is to select 
a formulation of an optimization problem that can be carried out (mostly by computer) in 
a reasonable length of time. This led to the Hales-Ferguson choice of an very complicated 
partition and score function, giving an inelegant local inequality, which however has good de- 
composition properties in terms of the nonlinear program. Much of the work in the proof lies 
in the reductions to reasonable sized cases, and the use of linear programming relaxations. 



The elimination of the most complicated cases, in Theorem 5.2 was a major accomplishment 
of this approach. The use of Delaunay simplices to cover most of the volume where density is 
high seems important to the proof and to the choice of score functions, since simple analytic 
formulae are available for tetrahedra. The Hales - Ferguson proof, assumed correct, is a tour 
de force of nonlinear optimization. 

In contrast, the Hsiang approach formulates a relatively elegant local inequality, involving 
only Voronoi domains and a fairly simple weight function: only nearest neighbor regions are 
counted. It is conceivable that a rigorous proof of the Hsiang inequality can be established, 
but it very likely will require an enormous computer-aided proof of a sort very similar to the 
Hales approach. Voronoi domains do not seem well suited to computer proof: they may have 
40 or more faces each, and the Hsiang approach requires considering up to twenty of them at 
a time. A computer-aided proof would likely have to dissect the Voronoi domains into pieces, 
further increasing the size of the problem. 



Acknowledgments. I am indebted to T. Hales for critical readings of a preliminary version, 
with many suggestions and corrections. G. Ziegler provided comments and corrections. 
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Appendix A. Hales Score Function Formulas 



These definitions are taken from in Hales pP-I| , Section 8] and Ferguson and Hales [FH, p 



8-11]. A tetrahedron T(/i, Z2, ^3, ■ ■ ■ , k) is uniquely determined by its six edge lengths k. Let 
the vertices of T be vq, vi, V2, V3 and number the edges as 

/i = ||v-Vj|| for 1 < i < 3, ^4 = ||v2 - V3II, ^5 = ||vi - V3II and /e = ||vi - V2II . (6.1) 

We take vq = for convenience. 

Suppose that the circumcenter Wc = Wc(T) of T is contained in the pointed cone over 
vertex vq, determined by T. Let Tq denote the part of the Voronoi cell of vq with respect to 
the set $7 = {vo,vi, V2V3} of vertices of T that lies in T. Suppose in addition that the three 
faces of T containing vq are each non-obtuse triangles. Then the set Tq subdivides into six 



pieces, called Rogers simplices by Hales |SP-I| , p. 31]. A Rogers simplex in T is the convex 



hull of Vq, the midpoint of an edge emanating from vq, the circumcenter of one face of T 
containing that edge, and the circumcenter Wc = Wc(r). If a denotes the half-length of an 
edge, b the circumradius of a face and c = ||yc|| is the circumradius of T then the associated 
Rogers simplex has shape 

R{a, b, c) := T{a, b, c, (c^ - b')'/', (c^ - a')'/', {b^ - a'f'^) , (6.2) 

with the positive square root taken. The intersection of a unit sphere centered at with 
R{a,b,c) has volume |S'o/(vo; i?(a, 6, c)), where 5o/(vo; ii(a, 6, c)) denotes the solid angle of 
R{a, 6, c) at vq, normalized so that a total solid angle is Air. Set 

Xi = If (6.3) 

are the squares of the edge lengths. 

Lemma 6.1 The solid angle SoI{vq,T) of a tetrahedron T(Zi, ^2, ^3, ^4, ^5, ^e) 'is given by 

Sol{^ro,T) := 2 arccot (6.4) 

in which the positive square root of A is taken, the value of arccot lies in [0,7r], and 

A{h,l2,h, k, k, k) := hhh + \h{ll + ll- ll) + \h{ll + ll - ll) + \h{ll + ll- ll) (6.5) 
and 

ilil/ ;2 I 72 I 72 ;2 i ;2 



^(^1) ^2) ^3) ^4) ^5) ^e) '■— hk{~h + h + l-i~ k + h~ h) 

ll) 



"^2^5(^1 ~ ^2 + ^3 + ^4 ~ ^5 + ll^ 



+^3^1(^2 + ^2 ~ ^3 + ^4 + ^5 ~ ll) 

—hhh ~ hhh ~ hhh ~ hhh • (6-6) 

Definition A.l. (i) for a tetrahedron T(/i, h, ■ ■ ■ , h) with vertex vq, if the circumcenter Wc 
of T falls inside the cone determined by T at vq, then we set 

z;or(T,vo) := 4 ^ |t;oZ(i?,(o, 6, c))(-5orf) + vo)| (6.7) 
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with 

vol{R{a,h,c)) := ^ ^- for \<a<h<c. (6.8) 

6 

This formula satisfies t'or(T, vq) = 4r(To). 

(ii) The six tetrahedra Ri{a,b,c) are stiU defined even when the circumcenter Wc falls 
outside the cone of T at vertex vq, and we still take the formula (6.7) to define vor{T, vq), except 



that both vol{Ri{a,b,c)) and Sol{Ri,vo) are counted with a negative sign: each tetrahedron 
Ri{a,b,c) falls outside T, and has no interior in common with it. 

Hales calls the definition (ii) the "analytic continuation" of case (i). It has a geometric 
interpretation. 

The truncated Voronoi function vor{T, vq; t) of a tetrahedron T at vertex vq is intended to 
measure the compression r(ro n -B(vo; t)). Here we have truncated the region Tq by removing 
from it all points at distance greater than t from vq. We set 

voroiT, vo) := vor ^T, vq, . (6.9) 

The definition 

vor{T, vo; t) := T{fo D S(vo; t)) (6.10) 

is valid only when the circumcenter Wc of T lies in the cone generated from T at vertex vq. 
In the remaining case one must construct an analytic representation analogous to (|6.7D for 



vor{T,VQ;t). This is done in [FH, pages 9-10]. 
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Appendix B. References to the Hales Program Results 



This paper was written to state the Hales-Ferguson local inequality in as simple a way as 
I could find, and does not match the order in which things are done in the preprints of Hales 
and Ferguson. Also, the lemmas and theorems stated here are not all stated in the Hales and 
Ferguson preprints; some of them are based on the talks that Hales gave at IAS in January 
1999. The pointers below indicate where to look in the preprints for the results I formulate as 
lemmas and theorems. Warning: The Hales-Ferguson partition and scoring function given in 
which are the ones actually used for the proof of the Kepler conjecture, differ from those 



FH 



used earlier by Hales in |SP-I| and ||SP-Hf| . 



(0) The idea of considering local inequalities that weight total area and covered area by 
spheres in a ratio ^ that is not equal to the optimal density occurs in Hales' original 



approach based on Delaunay triangulations, see |H1] [|H2|] . It also appears in Hales [3P-I, 
Lemma 2.1] and in Ferguson and Hales [FH, Proposition 3.14]. I have inserted the 



parameters A and B in order to include the density inequality of Hsiang|Hs| in the same 
framework. 



(1) Definitions 4-1 md J^.i appear in pH, page 2]. 



(2) Lemma 4-1 is Lemma 1.2 of pH| ], proved in Lemma 3.5 of | SP-]| ]. (The fact that no vertex 
of Q occurs inside a face of a QL-tetrahedron or a Q-R-tetrahedron requires additional 
argument.) 



(7) 



(8) 



(3) Lemma \4-^ follows from Lemma 1.3 of 



(4) Lemma ^.5 is proved in [FH, p. 3 bottom]. 



(5) Lemma 4-4 is covered in the discussion on [|FH|, pages 5-6]. 



(6) Lemma (i) -(iil) are covered in the discussion on pH| , page 5], including Lemma 1.8 
of that paper. 



The notion of "tip" is discussed at length in section 2 of Hales [3P-H|. In part II "tips" 
are not actually reassigned- although this is mentioned - their existence affects the scoring 
rule used for the associated Delaunay simplex which the "tip" is associated to. The rules 
for moving "tips" around to make T^-cells in the Hales-Ferguson approach are discussed 



on [FH, page 8]. Warning: the way that "tips" are handled in part II and in [FH[ may 
not be the same: [ p^H] ] takes priority. 



Lemma \4.f^ (i) is [ 3P-H , Lemma 2.2] and [FH, Lemma 4.17]. Facts related to (ii) are 
discussed in jSP-I , Sect. 8.6.7]. (For the second part I do not have a reference.) (iii) 
Hales mentioned this in IAS lectures, and sent me a proof sketch, which I expanded 
into the following: Let 5 be a simplex in the D-system that overlaps a "tip" protuding 
from V. Say that the "tip" overlaps by pointing in to S along a face F of S. Thus 
is a negatively oriented face of S' = {F,v), which means that the simplex S' is a 
QiJ-tetrahedron or else a QL-tetrahedron with spine on F. Suppose first that S' is a 
Q-L-tetrahedron. It now follows that S must be QL-tetrahedron with its spine on F by 
[ PH| , Lemma 2.2]. So 5' and S are adjacent QL-tetrahedra with spines on their common 
face F. Now S is in the D-system since S' is in the D-system. Thus the distance of v to 
the vertices in F is at most since v is not on the spine. We now suppose that the 
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"tip" is not entirely contained in S, and derive a contradiction. If it isn't contained in 
S, then it crosses out through a face F' of S. By the same argument, the distance from 
V to the vertices of F' is at most Thus v has distances at most 1^ from all vertices 
of S, which is impossible by |FH, Lemma 1.2] and |FH, Lemma 1.3]. Suppose secondly 
that S' is a Qi^-tetrahedron. Then one shows that S is also a Q-^'tetrahedron, hence is 
in the D-system. The rest of the argument goes as before, to the same contradiction. 



(9) Theorem 5.1. The main theorem is first stated as Conjecture 3.15 in [FH, p. 13]. It is 
the theorem asserted to be proved in ||K( 



(10) Lemma 5A (i) appears as |FH, Lemma 3.13]. (ii) is a special case of [FH, Lemma 3.13] 
for a quad cluster, which can consist of four congruent Q-^^-tetrahedra. 



(11) The standard regions corresponding to the graph ^(v) are defined on [FH, p. 4]. ("Planar 
map that breaks unit sphere into regions.") 



(12) Lemma 5J follows from Lemma 1.6 of [FH[, which implies that crossing lines come from 
Q-L-tetrahedra only. 



(13) Lemma 5^ is an immediate consequence of Lemma p.2| and [FH, Lemma 1.3]. 



(14) Lemma 5.4 appears as [FH, Lemma 3.13]. 



(15) Theorem 5A follows from the Corollary to Theorem 4.4 of [ SP-IV [. See also Proposition 
7.1 of pPJH|] . 



(16) Lemma 5J. and Lemma 5J.. These results are briefly stated at the bottom of p. 8 of 



There are also some relevant details in Hales [ SP-II , Sect. 2.2]. (I do not know an 
exact reference for detailed proof.) 





Hales-Ferguson terminology 


Terminology in this paper 


(1) 


decomposition star 


vertex D-star 


(2) 


quasiregular tetrahedron 


Qi?-tetrahedron 


(3) 


quarter 


Q-^^-tetrahedron 


(4) 


diagonal (of quarter) 


spine (of QL-tetrahedron) 


(5) 


Q-system 


Z)-system 


(6) 


score cr{R, v) 


weight function a{R,v) 


(7) 


standard cluster 


cluster 
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